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ABSTRACT 

We apply order statistics (OS) to the bright end {Mr < —22) of the luminosity 
distribution of early-type galaxies spectroscopically identified in the SDSS DR7 
catalog. We show that an overall normalized luminosity function can be derived 
from the data that describes the distribution of red elliptical galaxies sufficiently 
for the purposes of OS in a broad redshift range of 0.1 < 2 < 0.5. We calculate the 
typical OS quantities of this distribution numerically, measuring the expectation 
value and variance of the k*^ most luminous galaxy in a sample with cardinality 
N over a large ensemble of such samples. From these statistical quantities we 
explain why and in what limit the k*'^ most luminous galaxies can be used as 
standard candles for cosmological studies. 

As a sample application of OS we show that galaxy counts in different redshift 
ranges can be easily estimated if the absolute magnitude of the few most luminous 
galaxies and the overall shape of the luminosity function is known (and does not 
change significantly with z). First we demonstrate that the absolute magnitude 
of the k^^ most luminous early-type galaxies can be estimated from galaxy num- 
ber counts in the investigated redshift range. By reversing the method, galaxy 
counts can also be very easily calculated, for example, in redshift ranges where 
spectroscopic data is available only for the brightest sources. 

Since our sample contains all bright galaxies including the brightest cluster 
galaxies (BCG), based on OS we argue that BCGs can be considered as statistical 
extremes of a well-established Schechter luminosity distribution when galaxies 
are binned by redshift and not cluster-by-cluster. We presume that the reason 
behind this might be that luminous red ellipticals in galaxy clusters are not 
random samples of an overall luminosity distribution but biased by the fact that 
they are in a cluster containing the BCG. We show that a simple statistical toy 
model can reproduce the well-known magnitude gap between the BCG and the 
second brightest galaxy of the clusters. 
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Subject headings: galaxies: elliptical and lenticular, cD - galaxies: luminosity 
function, mass function - galaxies: evolution - galaxies: interactions 



1. Introduction 

The physical properties of luminous red galaxies (LRG) make them suitable to select a 
consistently evolving sample in a broad redshift range. LRGs are the most massive stellar 



imately redshift z ^ 1.2 with current observat i onal techniques (lYork et al. 
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Thanks to the significant break in their spectrum at 4000 A it is relatively easy to de termine 



their redshift even from spectra observed at low S/N ratio ( lEisenstein et al.ll200ll ). LRGs 



lar population has been passively evolving since then ( 
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Vlaraston et al. 


2009). We used a consistent sample of luminous 



red galaxies taken from the Sloan Digital Sky Survey DR7 to demonstrate the statistical 
methods subject to this paper. 

Order statistics (OS; in the special case of = 1 first order statistics or extreme value 
statistics - EVS) is a field of statistics concentrating on determining expectation values and 
moments of extrema and the k^^ largest (smallest) value of samples of a given cardinality 
drawn from an underlying probability density distribution. Expectation values and higher 
moments of statistical quantities can be expressed as functions of the sample size N and the 
order k. While these calculations are complex analytically, numerical integration of the for- 
mulae is very simple by the Monte-Carlo method which is essentially equivalent of computing 
the statistics of mock catalogs generated by using a predefined luminosity function. Extreme 
value theory applied to independent and identically distributed or weakly correlated vari- 
ables yields several theorems about limiting distributions of extrema: the behaviour of the 
tail of the underlying distribution determi nes the kind of the distri b ution of ex t reme values 
and only a few of these exists as shown by lFisher fc TippettI (Il928l ): iGnedenkd (jl948( ). 



The brightest LRGs can be found in the central regions of galaxy groups and clus- 
ters and have been successfully used as standard candles because of the small scatter in 



their intrinsic lurninosity, once correction fo r their evolution is applied (jSandagelll972l . Il976 



Postman fc Lauerlll995l : IWhiley et al.ll2008l ). It is a widely discussed question whether these 
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galaxies can be considered statistical extremes of the same luminosity distribution as the rest 
of the early-type galaxies or they belong to a different distribution ( Geller fc PeeblesI 



2007 



Liu et al 



2008 



1976 



Geller fc Postmanlll983[lLoh fc Strau3 2006: IPe Lucia &: Blaizotll2007l : lvon der Linden et alj 



Lin et al.ll2010l ). The latter case would suggest certain selection effects 



and different physical processes in their formation and evolution. One of the main statistical 
arguments against that they belong to the same luminosity distribution as the rest of early- 
type galaxies is the magnitude gap of AM ^ 0.8 mag betw een the BCGs and the second 
brightest galaxies of the same clusters (ILoh fc Straussll2006l ). 



Earlier astrostatistical studies applied EVS to investigate the distribution of the bright- 
est cluster galaxies and claimed that the distribution of the brightest cluster galaxies is 
different from those of brightest group galaxies. It was stated that the luminosity distribu- 
tion of the latter follows the extreme value distribution of an un derlying distribution with 



tail extending to infinity b ut decaying faster than any power law (IBhavsar &: Barrowlll985 



Bernstein fc Bhavsarll200ll ). They did not take into account however, that for small values 
of sign ificant corrections ap p ly to the theoretical lirn i ting e xtreme value distributions as 
shown by iGyorgyi et al.l (120081 ) ; iTaghizadeh-Popp et al.l (120 lOl ) . 



The structure of this paper is as follows. In Sec. [2] we describe the dataset we used and 
our method to calculate the absolute magnitudes. In Sec. [3] we determine the luminosity 
distribution of our sample. We give a short introduction to the basics of the area of extreme 
value and order statistics in Sec. H] and calculate the most interesting OS quantities of the 
luminosity distribution of our sample. In Sec. O we demonstrate the applicability of OS to 
determine galaxy counts at high redshifts where only the brightest galaxies were observed. 
In Sec. [6] we describe a very simple model that can reproduce the megnitude gap between 
the BCG and the second brightest galaxy of clusters. In Sec. [7| we discuss the capabilities, 
limitations and relevance of EVS/OS in cosmological investigations including the quest for 
the brightest cluster galaxies. 

Throughout this paper we adopt a fiat ACDM cosmology with h = 0.7, Qm = 0.3 and 
Q\ = 0.7. Where not otherwise noted, absolute magnitudes are considered K- and evolution- 
corrected and calculated from extinction corrected best radial fit model magnitudes. 



2. The data 

As we based our sample selection on a stellar population evolution model and K- and 
evolution-corrected absolute magnitudes it is important to discuss this model before describ- 
ing the selection criteria. 
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2.1. K-correction and evolution correction 



Since the mass to light ratio of passively evolving stellar populations changes heavily 
with time it is very important to correct for stellar evoluti on when comparing ga l axy counts 
at different epochs based on thei r mea sured magnitudes flEisenstein et al.l 1200 ll : iLohl l2004l : 
Loh fc StraussI l2006l : IWake et al.ll2006l ). To correct for stellar evolution and determine a 
co mparable absolu t e mag nitudes we adopted the best-fit composite stellar population model 
of iMaraston et al.l (120091 ) . This semi-empirical model describes the evolution of the SDSS 
g — r and r — i colors of red sequence galaxies fairly well in the required redshift range. The 
model assumes that the stellar populations of the red sequence galaxies would be equally 
12 Gyr old at redshift 2; = and they have been evolving passively since they had assembled 
at redshift z >2. The original model spectra were computed in 1 Gyr increments, we used 
linear interpolation between those time steps. Fig. [1] shows the evolution of a K-corrected 
(i.e. not redshifted) passively evolving stellar population in the SDSS g', r', i' bands. 



Following the recipe in iMaraston et al.l (120091 ) we computed t he value of K-correcti on for 
the M, g, r, i and z filters of SDSS using the transmission curves (IFukugita et al.lll996l ) from 
the SDSS web site for every galaxy as if they were at redshift z = 0. For evolution correction, 
we computed the flux ratios of a model at the observed redshift with age ti = 12 Gyr — tib 
and a model at 2; = with age ^2 = 12 Gyr, where tif, is the cosmic look-back time. 

Adopting a constantly evolving model that does not allow for scatter in the age of stellar 
populations at a given redshift seems to be too simplified but it turned out to work better 
than fitting models of varying age to the individual galaxies. Former studies focusing on 
early-type galaxi es concluded that th e dispersion in the age of high-mass ellipticals is in the 
100 Myr regime jGallazzi et al.ll2006h . Also, the scatter in K -|- e corrected absolute mag- 
nitudes is significantly higher when each galaxy's age is fitted individually. This is thought 
to be due to the degeneracy in photometric error scatter, and scattering due to age and 
metallicity around the magnitudes computed from the model spectra. Thus, the constantly 
evolving model works well for the vast majority of the brightest galaxies with passively 
evolving stellar populations. Also, galaxies experiencing episodic quiet star formation due 
to minor merging or smooth accretio n are thought to h ave very similar optical colors as 
their completely passive counterparts (IKaviraj et al.ll2008l ). The model cannot be applied to 
those galaxies that recently evolve d onto the red sequenc e via merging or quenching of star 
formation from the blue sequence (IBell et al.ll2004l . l2007l ). However, the latter galaxies are 
thought to have the luminosity below our absolute magnitude limits. 
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2.2. Notes on the evolution correction 



We would like to emphasize the importance of right evolution correction of the magni- 
tudes of early-type galaxies in order to obtain physically meaningful results. As we observe 
galaxies at higher and higher redshifts, their colors become redder due to the shifting of the 
observed wavelength of the 4000 A break. On the other hand, as we look further out we see 
younger stellar populations that tend to be intrinsically bluer and brighter. These two effects 
compete in the SDSS r and i bands and they almost cancel out each other between z = 
- 0.35 and z = - 0.6 respectively, as can be seen in Fig. [2l The difference between pure 
K-correction and K+e-correction can be as big as 0.5 mag which could distort the absolute 
magnitude limited co-moving density estimates significantly. 

Forthcoming large photometric surveys like PanSTARRS and LSST will not measure 
spectroscopic redshifts. Hence, reliabl e photometric redshift esti mation of strongly evolving 
galaxies will be crucial in the future ( iPadmanabhan et al.ll2005[ ). Many current, template- 
based, widely used photometri c redshift estimation algorithms do not take stellar population 
evolution into account directly ( Bolzonella et al.lboOO : Blanton et al.llioOS : Blanton fc Roweis 
20071 : iHildebrandt et al.ll2010l ). Certain algorithms solve this problem by using a numerous 
set of LRG templates with varying ages and attempt to fit the age from photometry alone 
without making an assumption on the covariance of the age and redshift. This is not a 
good practice in case of LRGs as photometric scatter and scatter from age and metallicity 
differences cannot be easily separated from optical broadband photometry. Also, using too 
many templates introduces too many degrees of freedom in fitting. 

Empirical or semi-empirical template-based algorithms however are able to achieve 
good photometric redshift estimates by "fixing" the spectrum templates, by fitting them 
to the data, producing slightly distorted template s that have some sort of effective evolu 



tion encoded into th e overall curve of the spectrum (ICsabai et al.ll2000l : iBudavari et al.ll2000 
Blanton et ablboosh . 



2.3. Our subsample of SDSS DR7 spectro 



For this study we used the SDSS DR7 spectroscopic data set, spectro hereafter (jAbazajian et al. 



20091 ). To create our early-type galaxy sample we selected all galaxies from spectro which 
had the best cross-correlation redshift confidenc e (but the minirnum o f Zconf > 0.9) with 
the LRG template of the SDSS spectro pipeline (lEisenstein et al.ll200ll ). Since fibres of the 
SDSS spectrograph cover only the central 3" of extended sources, encompassing only the 
bulge of nearby spirals, low-redshift late-type spectra can closely resemble those of early- 
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type galaxies. Though bulges of spirals are much fainter than LRGs, they can affect number 
counts calculated with lower magnitude limits. This issue can be partially resolved by taking 
observed photometric magnitudes and radial profiles into account which help to discriminate 
between bulges of spirals and elliptical galaxies. Consequently, we also required that the se- 
lected ga laxies must be close en ough to the trajectory defined by the evolving fiducial LRG 
model of iMaraston et al.l ( l2009l ) in the g — r; r — i; i — z color-color space. The following 
constraint was set. 



^{{9 -r)- {gm - rm)f + ((r - i) - {r^ - im)Y 



((, _ ^) _ _ ^^))2 < 0.015, (i: 



where g, r, i and z are the observed, dereddened best-fitting model magnitudes, and g^, 
im and z^ are the synthetic magnitudes of the stellar population model along the trajectory 
(refer to Sec. 12. li on the model used). 

Also because of the 3" spectroscopic fibre radius, additional constraints must have been 
set to restrict source selection to elliptical galaxies only. We required that all galaxies must 
have good de Vaucouleurs' radial profile fits in the r b and (fraction of t he de Vaucouleurs' 



function in the total radial profile fit is over 80%, refer to lStoughton et al.l (120021 ) for details on 
surface brightness fitting in SDSS) and that all galaxies have reliable redshift measurements 
(with over 90% confidence). 

We verified the most luminous 1,000 galaxies of the automatically selected sample by 
eye and excluded the outliers from the data set. The exclusion rate was about 1.5% and 
the reason of exclusion usually was the closeness of a very bright object or "ghost" which 
confused the SDSS photo pipeline leading to invalid magnitudes. The selection process 
resulted in a sample of 142,762 galaxies ranging to 2; = 0.63 in redshift and covering 8,032 
square degrees of the sky. The brightest part of the sample (M^ < —22) contains 73,459 
galaxies. We estimated the spectro completeness factor as 0.9 in the density calculations. 

The uniformity of our sample is not perfect below z < 0.07 due to the well-known prop- 
erties of the SDSS: The SDSS photometric reduction algorithm might erroneously deblend 
galaxies with very large apparent diameter into two when more than one high surface bright- 
ness peak is detected within the same source. As we are interested in the biggest galaxies 
this issue can significantly effect the n umber counts at very low redshifts. The total LRG 



sample is volume-limited to 2; = 0.38 ( iLoh Sz StraussI l2006l ) but all galaxies brighter than 



Mr < —22 were observed up to redshift z = 0.45, galaxies brighter than M,. < —22.5 up to 
z = 0.51. 
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3. The LRG luminousity function 

In this section we estimate the magnitude distribution of our LRG sample and fit the 
parameters of the Schechter function in the form of 



0(M) dM = 0oexp{C(a + l)(M-M*)- 

- exp [C{M - M*)]} dM, (2) 

where C = -0.4 log 10. 

In Fig. [3] and m we plot the luminosity distribution of our spectroscopically selected sam- 
ple in redshift bins of Az = 0.02. The magnitudes are corrected for redshift and evolution. 
Because the co-moving density of early-type galaxies grows slightly with cosmic time the 
higher curves belong to lower redshifts. The cut-offs fainter than M,. > —22.5 are due to the 
magnitude limit of the sample, but we are only interested in the bright-end slope of these 
curves. Although the curves on Fig. |4] run together reasonably well without any rescaling, 
in order to get an overall bright-end luminosity function that is valid over a broad redshift 
range, these curves need to be scaled together by compensating the density growth. 

For reference, we also plot the luminosity distribution of the same galaxies with only K 
(but not evolution) corrected magnitudes in Fig. [51 Here curves do not overlap automatically. 

Fig. E] shows the change of the co-moving density of the brightest part of the early- 
type galaxy population. To get a volume-limited sample to as far as redshift z = 0.5, only 
galaxies having the r-band absolute magnitude between —22.5 > Mr > —23.5 are counted. 
The cut-off in density aX z = 0.5 is due to the magnitude limit of the sample, while the cause 
of the cut-off at low redshift is explained in Sec. 12. 3[ 

The evolution of the co-moving number density only affects 0o- In the rest of the 
paper we used a normalized luminosity function that is independent of the galaxy density 
and show that the presented statistical method only depends on a and M*. In order to 
scale the luminosity distribution curves of Fig. S] together and get Fig. [71 we estimated the 
change of galaxy number density as a weak exponential function of redshift. We scaled the 
luminosity distributions of the individual redshift bins such a way that they all had the same 
co-moving density as the redshift bin aX z = 0.2. The result of the fitting in the redshift 
range of 0.2 < 0.5 is also plotted in in Fig. [6]) with the slanted dashed line. The vertical 
lines represent the redshift range used for fitting. The fitted parameters of the function are 
the following. 
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n(z] 



■]^g(-1.02±0.06) ,2+(-5.16±0.02) 



(3) 



This means a growth of about 0.3 dex i n galaxy density f rom z = 0.5 t o z = 0.2, which 



is consistent with the results of others (e.g. iBell et al.l ( 120041 ): iFaber et al.l (120071 )) 



The results of the normalization is plotted in Fig. [71 The curves run fairly well together 
at the bright end, but scatter at the very high luminosities is due to the very low number 
counts. The bright end of the luminosity distribution curves can be scaled together for 
our purposes because of thei r weak dependence on a and M*, which are otherwise known to 
evolve slightly with redshift ( iBell et al.ll2004j : iFaber et al.ll2007l ). These curves were averaged 
to get an overall luminosity distribution that is valid over a broad redshift range. 

Fig. [H] shows the averaged luminosity distribution (solid line) and the fitted Schechter 
function in the form of Eq. [2l The fitting was done in the —22.5 < Mr < —23.5 range 
by keeping the faint-end parameter a fixed at a = —1.2. This was done because a cannot 
be estimated sufficiently well from the brightest galaxies only. The resulting values of the 
parameters are a = -1.20, log^o (0o) = -7.06 ± 0.21, M* = -21.10 ± 0.04 where is 
calculated with respect to z = 0.2. We will use this luminosity distribution in the following 
sections to give an introduction to order statistics. 



4. Extreme Value and Order Statistics 

4.1. A brief introduction to EVS/OS 

Let us consider the probability density function f{x). We are looking for the expectation 
value of the maximum (or minimum) of a sample of N numbers drawn from this distribution 
randomly. The cumulative distribution function belonging to f{x) is defined as usual: 



F{x) 



f{u) du 



(4) 



hence, the probability of a number x drawn from the distribution being less than a given X 
is simply 



P{x <X) = F{X). (5) 

Consequently, if we have a sample of N independently drawn numbers {xi,X2, ...,X]y} 
we can calculate the probability of X^ = max{xj} being an upper limit of the sample as the 
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probability of all numbers in the sample being less than Xm- 

PUXJ = P{x, < = P^(x < XJ = F^(X^). (6) 

Differentiation with respect to X^ yields the probability density function of the maximum 
of a sample of size of A^: 

N) = iVF^-i(X^)/(x), (7) 

where f{x) is the underlying (original) probability density function. All we have to do is 
to calculate the expectation value (or any higher moment) of Xm drawn from pm{Xm, N) 
which, in case of a general f{x) probability function and small A^, is not possible analytically. 
It is very easy to calculate the integral numerically by the Monte-Carlo method, however. 

Following the same pattern one can determine the probablity distribution, expectation 
value and moments of the 2"''^, S*""^, etc.: the k*^ largest value of the sample. The probability 
distribution of the k^^ largest value is the following. 



P(fc)(X(fc),iV) 



{k-l)\{N-k)\ 



-^i^"^-^^(.))/(X(.)) 



The first theorem of extreme value statistics (Fisher-Tippet-Gnedenko theorem) states 
that in general the maximum (minimum) of a sample of numbers converges to one of the 
three possible distributions depending on the behaviour of the tail of the distribution from 
which the samples were drawn. This is considered the equivalent of the central limit theorem 
for extremes. Underlying distributions with exponential tails will all have the same extreme 
value distribution in the i V — )■ oo limit (once no r malized for the first two morn ents): the 
Gumbel distribution g{x) JFisher fc Tippettlll928l : iGnedenkolfigisI : lGumbelll958h . 



(9) 



It is important to emphasize that the Gumbel distribution is only applicable in the 
high-cardinality limit. Because the convergence is slow (usually logarithmic in A^), and its 
speed also depends on the underlying distribution f{x) at small sample sizes, significant 
corrections apply to the double exponential. These corrections also depend on f{x) and 
analytic formulae ar e hard to calculate; numerical simulations are usually simple, however 
dGvorgvi et aPboOsh . 
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While other fields of science use extreme value statistics to calculate the probability of 
rare, catastrophic events (usually strong earthquakes, fioods, stock exchange crashes etc.) in 
astronomy we do see these rare "events": the brightest objects of a certain kind. Because 
of the magnitude limit of the telescopes over a certain redshift only the most luminous 
galaxies, quasars, gamma ray bursts etc. are observable. Extreme value statistics might 
allow us to infer further information about the number of the undetected objects solely from 
the brightest detections if we have a model on the luminosity distribution of the objects. 

4.2. Applying EVS/OS to the early-type galaxy luminosity function 

In this section we show an example on how EVS/OS is applicable to extrapolate early- 
type galaxy counts beyond the redshift where the magnitude limit of the spectroscopic sample 
starts to affect the measured galaxy counts. 

Although the Schechter function, if written as a function of the magnitude, is not a valid 
probability density as it is not capable of being integrated over all magnitudes, it becomes 
integrable and can be normalized whenever a lower magnitude limit is introduced. 

We took the analytic fit to the measured, rescaled and averaged r-band luminosity 
distribution (as explained in Sec. EJand plotted on Fig. |H]), applied a cut-off of < —22 mag, 
and ran the Monte-Carlo integration to get the expectation value of the maximum luminosity 
(minimum magnitude) as a function of the sample size A, and also the expectation values 
of the 2""^, S*'^, lO*'*, 20*^* and lOO*'^ most luminous galaxy. The results are plotted in Fig. |9l 
As the plot shows, the expectation value of the luminosity raises rapidly with A for small 
values of A and turns into a slowly raising curve at higher sample size. For the Schechter 
parent distribution, these curves take the generic form of In In A for large enough A because 
around the expectation value of the maximum Lm the following is true. 




(10) 



where 5 is the unit interval on the scale oi L/L*. For ^ L* this yields 



L 



m 



lnA + 0(lnlnA). 



(11) 



L* 



The second logarithm comes from the logarithm of the magnitude scale. The very 
slowly rising behaviour of these In In A curves is the reason behind t he fact that the bright- 
est cluster galaxies can be used as standard candles (for example ISandagd (119721 Il976l ): 
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Postman 8z Lauerl (119951 ): IWhiley et al.l (j2008[ )). (Here we assume for a moment that BCGs 



are drawn from the same distribution as the rest of early-type galaxies. We will discuss 
this issue later.) The expectation value of the luminosity is evidently decreases with k, the 
dependence is also logarithmic. 

The standard deviation of M(fc) shows a more interesting behaviour that limits the 
usability of EVS/OS for high-precision studies and also explains some well-known rules of 
thumb of astronomy. In Fig. [TU] we plot the standard deviation of the expectation value of 
the magnitude of the P*, 2"'', S*'^, 10*'', 20*'' and lOO*'' most luminous galaxy as a function of 
the sample size N. By looking at the individual curves, it is clear that the variance decreases 
only slightly with bigger sample size N, but changes significantly with higher order k. This 
is the reason that, for example, the magnitude of the brightest cluster galaxies changes more 
from cluster to cluster than the magnitude of the second, fifth etc. brightest galaxy and why 



they are better standard candles than BCGs (jScottI 119571 ). 



In Fig. [m we plot the standard deviation of M(^k^ as a function of k for four different 
A^. As the plot shows, the variance decreases very rapidly with k and the curves have an 
inflexion point around k ^ A^/2. Above k ^ A^/2, the variance decreases only mildly with 
k. This means that if we are able to measure the apparent magnitude and redshift of 5 - 
10 % of the most luminous objects in a certain area of the sky or in a certain redshift bin, 
we can find standard candles with an accuracy of 0.2 mag. The actual directly measured 
number counts in our redshift bins at higher redshifts are around 1000. Consequently, if we 
want to apply EVS/OS techniques to determine the number count of the fainter galaxies 
successfully in a certain redshift range, it would be necessary to measure the spectroscopic 
redshift of the brightest 50 - 100 galaxies per redshift bin. This targeting process requires 
accurate photo-z and absolute-magnitude estimates. 

For reference, we also plot the expectation value of M(fc) as a function of k for four 
different A^ in Fig. [T2j The important thing to observe here is that the curves actually reach 
the magnitude limit of Mr = —22 in the limit oi k N (only visible for A^ = 100 and 
A^ = 200). This is only true for underlying probability density functions with a hard cut-off. 

In Fig. [12] we plot the distribution of the magnitude of the k^^ most luminous galaxies 
drawn from a distribution based on the early-type luminosity function for A; = 1, 5, 10 and 20. 
The sample size was fixed at A^ = 100 and the simulation was run 500,000 times. The curves 
show distributions typical to order statistics with slightly asymmetric probability density 
functions. For k = 1 and N ^ oo the distribution should be the Gumbel distribution, 
for /c — > oo and N ^ oo the limiting distribution is Gaussian. The actual k = \ cannot 
be fitted very well with the theoretical Gumbel distribution because of the small A^, as 



discussed in Sec. 14.11 convergence to the Gumbel distribution is slow flGyorgyi et al.l 12008 
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Taghizadeh-Popp et al.ll2010[ ). The FWHM of the k = 1 curve is consistent with what we 
know about BCGs, i . e. the scatter in their absolute magnitude is around AM ~ 0.25 mag 
(ISandagelll972l . Il988l : lAragon-Salamanca et al.lll998[ ) from cluster to cluster. 



5. Estimating galaxy counts from the brightest LRGs 

To confirm our assumption that order statistics can be used to accurately estimate the 
brightest galaxies' absolute magnitude solely from number counts based on a single, non- 
evolving luminosity function, we refer to Fig. HH Here we plot the magnitude of the k^^ 
brightest galaxy in each redshift bin for two different bin sizes: big - Az = 0.02, and small 
- Az = 0.005. The black circles represent the brightest magnitudes of the big bins (thus 
they are usually brighter than in the small bins) while the open rectangles represent the 
brightest magnitudes of the small bins. The curves represent the expectation value of M(fc) 
as computed from the number counts in the redshift bins (thick line - big bins, thin line - 
small bins) based on order statistics. For the small bins, we also plot the 2a range (shaded 
area arount the thin curve). The curves computed from number counts agree reasonably well 
with the actual luminosity measurements in the redshift range where all galaxies brighter 
than Mr < —22 were observed. 

By inverting the functions plotted in Fig. [HI one can easily calculate the galaxy counts 
in a redshift bin from the magnitude of the k^^ brightest galaxy of the bin. Fig. [15] shows our 
estimate of galaxy density for six different values of /c as a function of z. For low values of 
k the estimates show very large scatter, but the fluctuations of the estimate for k = 100 are 
already comparable to the fluctuations of the directly measured galaxy density. These plots 
also show the extrapolation capabilities of the method. Directly measured galaxy counts 
go down around z ~ 0.45 because of the apparent magnitude limit of spectroscopic target 
selection, but the brightest galaxies are still in the sample up to redshift z = 0.55 - 0.6, and 
allow us to calculate galaxy counts for this redshift range too. 

We emphasize the extrapolation capabilities of our method to redshift ranges where 
the magnitude limit of the telescope or the target selection criteria forbid computing galaxy 
density directly in Fig. [161 The plots show OS estimates on the galaxy number density for 
six different absolute magnitude cuts from the brightness of the 100*^ most luminous galaxy 
in redshift bins of Az = 0.005. For low magnitude cuts, the target selection effects evidently 
determine the directly measured number counts. For brighter cuts, the OS estimates nicely 
converge to the measurements. It is important to note that we used the same luminosity 
function to calculate the OS quantities; only the integration limit was changed. 
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Statistics of the BCGs 



investigated by iLoh fc StraussI (120061 ) ; 



It lias bee n thought for d e cades f Presslerl Il976l: lOstriker fc HausmanI 119771 ). recently 



Lin et al.l ( 120101 ) that the brightest central galaxies 



of the clusters cannot be explained as extreme values drawn from the same luminosity 
distribution as the rest of the galaxies in rich clusters. The main argument was that the 
observed magnitude gap of M12 — 0.8 mag between the brightest and the second brightest 
member of a galaxy cluster cannot be explained if both galaxies are drawn randomly from 
the same underlying luminosity distribution. On the other hand, as we demonstrated in 
Sec. 14.21 and Sec. \5\ this is not necessarily the case if we look at the whole LRG population 
without considering galaxies to be part of clusters. The absolute magnitudes and the scatter 
of the magnitudes of extremely luminous red galaxies seem to be well-described by the single 
Schechter function that can be derived from the total ensemble of LRGs once the correct 
statistical method is applied. 

As we demonstrated in Sec. 14. 2^ the expectation value of the magnitude difference 
M12 = M(i) — M(2) between the brightest and the second-brightest galaxy of a random 
sample drawn from the LRG luminosity function is well below (M12) < 0.2 mag, cf. Fig. [9l 

In order to resolve this contradiction we investigated the statistics of the following very 
simple model. We populated uniformly a 1 Gpc^ volume with galaxies at a number density 
of 10-^-5 Mpc"l Next we assigned absolute magnitudes brighter than M < —22 to all of 
these galaxies. The magnitudes were randomly drawn from the LRG luminosity function we 
determined previously. We considered every galaxy a BCG if it had the absolute magnitude 
brighter than M < —23. For every such BCG, we determined the magnitude of the second 
and third brightest galaxy within 10 Mpc. The distribution of the magnitude difference 
between the BCGs and the second and third brightest galaxies is plotted in Fig. [T71 The 
expectation value of the gaps are M12 — 0.75 and M13 ^ 1.0, in good agreement with the 
observations. 

The model is very primitive compared to our knowledge about the large-scale structure 
of the universe, but one can consider the randomly positioned galaxies of the real space as 
the centers of the main progenitor DM halos of future massive galaxies at some very early age 
of the Universe when the mass distribution was homogeneous on small scales. The 10 Mpc 
cluster radius can be imagined as the radius of the DM halos from which the central region 
of a future cluster will form. Of course, the final cluster will have a much smaller radius due 
to the merging of these halos. 
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7. Discussion and summary 

We have shown in Sec. |5]and in Figs. [T^llT6] that EVS/OS can be successfully applied to 
the LRG sample of SDSS and yield consistent magnitudes and galaxy counts, with significant 
extrapolation capabilities if selection effects dominate the sample. Since the SDSS data set 
covers a significant part of the sky, we evidently included galaxy clusters of all sizes in the 
sample. Consequently, the sample contains all BCGs of these clusters. This raises two 
important questions that we would like to discuss in this section. 



7.1. The effect of clustering 

In order to demonstrate the extrapolation capabilities of the presented technique, we 
chose to bin our data in redshift bins instead of spatial bins, and all of these bins contained 
very bright cluster galaxies, because SDSS covers a very large area on the sky. What would 
happen if we restricted our sample to areas of the sky where only fainter galaxies are present? 
From fainter maximal luminosities, order statistics would estimate galaxy counts to be ac- 
cordingly lower. A fainter sample (in terms of absolute magnitudes) means fewer galaxies 
since the sample was probably targeted to avoid dense clusters. On the other hand, samples 
targeting brighter galaxies would cover dense groups and clusters with lots of galaxies; con- 
sequently, galaxy counts would be higher. EVS/OS can only be applied to samples covering 
a big enough area of the sky to cancel the bias caused by clustering. 



7.2. Brightest cluster galaxies 

Our resolution to the controversy about the magnitude gap experienced in case if BCGs 
is that clusters cannot be considered random samples in which galaxies are drawn indepen- 
dently from any distribution. Samples containing a BCG are special. We have showed in 
Sec. Elthat if spatial distribution of galaxies is also taken into account, one can very easily 
reproduce the measured statistics of clusters solely on the basis of a single LRG luminosity 
function. This result suggests that BCGs are indeed formed the same way as the rest of 
the LRGs, and the scenario of galactic cannibalism can be ruled out. However, our model 
is extremely simple and the statistics of BCGs requires more investigation base d on N-body 



simulations and merger trees. We will publish our results in a future paper (IDobos et al. 
201lh . 
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7.3. Possible application of order statistics based estimations in future surveys 

The demonstrated statistical methods highly depend on accurate absolute-magnitude 
measurements and spectroscopic redshifts. Upcoming large surveys like PanSTARRS and 
LSST will provide deep photometric datasets without any directly measured redshift infor- 
mation. The intrinsic e rror of photometric redshift methods is about Az ~ 0.01 - 0.02 



( iHildebrandt et al.l 120101 ). This is much larger than our redshift bin size, so redshift mea- 
surements with significantly better accuracy are required to apply our method with the 
same resolution. In light of extreme-value and order statistics, besides aiming for a complete 
spectroscopic follow-up survey of the large photometric surveys, it might be also useful to 
calculate preliminary redshifts based on photometry and target the brightest galaxies per 
redshift bin first for spectroscopy. This would reduce the number of targets significantly 
(a couple of thousands instead of hundreds of thousands), and still galaxy counts could be 
calculated at very far distances, even where only the brightest galaxies are observable. 

The k^^ brightest galaxies (for large values of k) are useful standard candles which can 
be used to test cosmological models. However, in case of early-type galaxies, this method is 
limited by the quality of the evolutionary model used to compute absolute magnitudes, just 
as by the model of the evolution of a and M* of their luminosity function. 
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Fig. 1. — K-corrected absolute magnitudes (SDSS g', r', i' band, from bottom to top) of a 
passively evolving model of an early-type stellar population with an age of 12 Gyr ai z = 0, 
normalized to 1 Mq. The exponential change of luminosity of early- type galaxies with age is 
clearly visible and the effect has to be corrected if one wants to compare galaxies based on 
their absolute magnitudes. (Note the inverted scale.) 
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Fig. 2. — K-correction (solid line) and K+evolution correction (dashed line) in the SDSS r' 
band (top panel) and i' band (bottom panel) for a passively evolving early-type galaxy with 
an age of 12 Gyr at ^ = 0. Note that the K+e correction is almost zero out to z = 0.35 and 
z = 0.65, respectively. 
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Fig. 3. — Luminosity distribution of the early-type galaxies in redshift bins of Az = 0.02 
between 0.14 < 2; < 0.50. The central redshift of the bins is indicated in the top right corner 
of each plot. The bright end of the sample (M^ < —22.5) is volume-limited in the whole 
redshift range. The low luminosity turn-offs are due to the limiting magnitude of m,, < 19.5 
of the spectroscopic targeting algorithm of SDSS. See Fig.Hlfor the same plot with the curves 
plotted over each other. 
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Fig. 4. — Luminosity distribution of the early-type galaxies in redshift bins of Az = 0.02 
between 0.15 < z < 0.49. The bright end of the sample [Mr < —22.5) is volume-limited in 
the whole redshift range. The low luminosity turn-offs are due to the limiting magnitude 
of rUr < 19.5 of the spectroscopic targeting algorithm of SDSS. The vertical dashed lines 
mark the magnitude interval in which galaxies are counted in order to scale the luminosity 
distribution curves together (see text). The bright ends of these curves run together reason- 
ably well without any rescaling but correction for the small change in co-moving density is 
necessary to derive an overall luminosity function (cf. Fig. [7]). 





Fig. 5. — Luminosity distribution of the early-type galaxies in redshift bins of Az = 0.02 
between 0.15 < 2 < 0.49. Only K-correction (and not evolution correction) was applied to 
the magnitudes (cf. Fig. H]). 
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Fig. 6. — Co-moving density of the most luminous (M^ < —22.5) early-type galaxies as a 
function of redshift. The number density decreases slowly with z between z < 0.15 < 0.5. 
The turn-down at low redshift is thought to be due to the bright-end incompleteness of the 
sample, see Sec. 12.31 The fitting was done between 0.2 < z < 0.5, as marked with the vertical 
lines. 
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Fig. 7. — Luminosity distribution of the early-type galaxies in redshift bins of Az = 0.02 
between 0.15 < z < 0.49 scaled to the density of galaxies brighter than < —22.5 at 
z = 0.2. The scaling was applied based on the co-moving density evolution described in 
Sec. [3] and plotted on Fig. |6l The curves now overlap very well in the —22.5 > Mr > —23.0 
magnitude range. The scatter at higher luminosities is due to very low galaxy counts. 
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Fig. 8. — Luminosity distribution of the early-type galaxies normalized and averaged over 
redshift bins of Az = 0.02 between 0.15 < 2; < 0.49. The shaded area represents the standard 
deviation of the different curves plotted in Fig. [71 The dashed line is the analytic fit to the 
data. The averaged distribution is a good representation of the magnitude distibution of the 
galaxies for < 22.5. The sample is incomplete below this luminosity for higher redshift. 
Fitting with the Schechter function was done in the —22.5 > Mr > —23.5 range, as marked 
with the vertical lines. 
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Fig. 9. — Expectation value of the absolute magnitude M(fc) of the P*, 2"^^, 5*^^, 10*^, 20*^^ 
and 100*'' (from top to down) most luminous LRG (brighter than M^. < —22) as a function 
of the sample size A^, computed directly from the luminosity function using the Monte- 
Carlo method, averaged over 5,000 runs. The vertical dashed lines mark the sample sizes of 
N = 100, 200, 500 and 1, 000 which are used to plot Fig. [TT]and Fig. [121 The curves show a 
log log behaviour. 
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Fig. 10. — Standard deviation of the absolute magnitude M(fc) of the 1"**, 2 , 5 , 10 , 
20*^^ and 100*'* (from top to down) most luminous LRGs (brighter than < —22) as a 
function of the sample size A^, computed directly from the luminosity function using the 
Monte-Carlo method, averaged over 5,000 runs. The vertical dashed lines mark the sample 
sizes of iV = 100, 200, 500 and 1,000, which where used to plot Fig. [ID and Fig. M The 
curves change only slightly with for > 100. 




Fig. 11. — Standard deviation of M(fc) in a random sample of = 100, 200, 500 and 1000 
galaxies as a function of k. An inflexion of the curves is observable around k ^ A^/2; above 
this, the variance does not change significantly with k. 
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Fig. 13. — Probability distribution of the magnitude of the first, fifth, tenth and twentieth 
most luminous galaxy in random samples of cardinality = 100. Results from a Monte- 
Carlo simulation of 500,000 runs. 
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Fig. 14. — Absolute magnitudes of the first, second, fifth, tenth, twentieth and hundredth 
most luminous galaxy, as functions of redshift in redshift bins of t^z = 0.02 (black circles) 
and Az = 0.005 (open rectangles). The solid curves show the expectation value of M(fc) as 
calculated from the measured galaxy counts in the redshift bin based on extreme value statis- 
tics. The thick lines represent magnitudes calculated for the big redshift bins, while the thin 
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Fig. 15. — Number density of galaxies brighter than Mr < —22 in redshift bins of A2; = 0.005. 
The points represent the predictions made from the absolute magnitude of the first, second, 
fifth, tenth, twentieth and hundredth most luminous galaxy of the redshift bin based on 
extreme value statistics. Solid lines (same on all six plots) are direct measurements of the 
galaxy density. Note that estimation from brighter galaxies allows extrapolation to higher 
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Fig. 16. — Extrapolation capabilities of order statistics. The plots show the co-moving 
number density of LRGs as a function of redshift calculated in redshift bins of Az = 0.005. 
Different plots correspond to different absolute magnitude cuts in the SDSS r band. Dashed 
lines represent the galaxy density directly calculated from galaxy counts, while the solid 
lines show the order statistics estimates from the magnitude of the 100*'' brightest galaxy 
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Fig. 17. — Probability distribution of the magnitude difference of the first and the second 
and third most luminous galaxy (M12 - thick line, M13 - thin line) in a sphere with a radius 
of r = 10 Mpc around simulated BCGs. The small vertical bars mark the expectation values. 
Results from a Monte-Carlo simulation of 10,000 runs. 




